Tabular Statistical Disclosure Control: Optimization Techniques in Suppression and Controlled Tabular Adjustment1
نویسندگان
چکیده
The problem of disseminating tabular data such that the amount of information provided satisfies the public need while protecting individually identifiable data is a problem in all governmental statistical agencies. The problem falls into the category of Statistical Disclosure Control and provides many difficult policy and technical challenges for these agencies. In order to achieve the double mission of dissemination and confidentiality protection, the agencies must balance conflicting objectives. Traditionally, agencies have relied on selective suppression of sensitive cells. Because of the difficulty of suppressing optimally and the problems that may result from publishing tables with omitted cell values, new ideas have been proposed based on selective adjustment of cell values. One such method is Controlled Tabular Adjustment by Dandecar and Cox [2002]. In this paper we discuss the theoretical, computational and practical issues of these two approaches to Statistical Disclosure Control.
منابع مشابه
Recent advances in optimization techniques for statistical tabular data protection
One of the main services of National Statistical Agencies (NSAs) for the current Information Society is the dissemination of large amounts of tabular data, which is obtained from microdata by crossing one or more categorical variables. NSAs must guarantee that no confidential individual information can be obtained from the released tabular data. Several statistical disclosure control methods ar...
متن کاملStatistical disclosure control in tabular data
Data disseminated by National Statistical Agencies (NSAs) can be classified as either microdata or tabular data. Tabular data is obtained from microdata by crossing one or more categorical variables. Although cell tables provide aggregated information, they also need to be protected. This chapter is a short introduction to tabular data protection. It contains three main sections. The first one ...
متن کاملMathematical Programming Models for Balancing Data Quality and Confidentiality in Tabular Data
1. Mathematical Programming Model for Controlled Tabular Adjustment (CTA) Statistical agencies use different methods to protect the confidentiality of tabular data. The most widely used method, complementary cell suppression, suppresses both primary (sensitive) and secondary (non-sensitive cells) to assure confidentiality. Despite its popularity, it suffers from severe limitations. Complementar...
متن کاملOn Assessing the Disclosure Risk of Controlled Adjustment Methods for Statistical Tabular Data
Minimum distance controlled tabular adjustment is a recent perturbative approach for statistical disclosure control in tabular data. Given a table to be protected, it looks for the closest safe table, using some particular distance. Controlled adjustment is known to provide high data utility. However, the disclosure risk has only been partially analyzed using theoretical results from optimizati...
متن کاملMaximum Utility-Minimum Information Loss Table Server Design for Statistical Disclosure Control of Tabular Data
Statistical agencies typically serve a diverse group of end users with varying information needs. Accommodating the conflicting needs for information in combination with stringent rules for statistical disclosure limitation (SDL) of statistical information creates a special challenge. We provide a generic table server design for SDL of tabular data to meet this challenge. Our table server desig...
متن کامل